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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

5 (i) APPLICANT: Genentech, Inc., Hsei, Vanessa 

Koumenis , Iphigenia 
Lecng, Steven R. 
Presta, Leonard G. 
Shahrokh , Zahra 
10 Zapata, Gerardo A. 

(ii) TITLE OF INVENTION: ANTIBODY FRAGMENT- POLYMER CONJUGATES 
AND HUMANIZED ANTI-IL-S MONOCLONAL ANTIBODIES 

15 (iii) NUMBER OF SEQUENCES: 72 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 1 DNA Way 

20 <C) CITY: South San Francisco 

<D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

25 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WinPatin (Genentech) 

30 

<vi} CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 20-Jan-1999 
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40 (vii) PRIOR APPLICATION DATA: 
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(B) FILING DATE: 24-JUL-1998 

(vii) PRIOR APPLICATION DATA: 
45 (A) APPLICATION NUMBER: 60/094013 

(B) FILING DATE: 24-JUL-1998 
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(A) APPLICATION NUMBER: 60/075467 
50 (B) FILING DATE: 20-FEB-1998 
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{B} REGISTRATION NUMBER: 34,659 
55 (C) REFERENCE/DOCKET NUMBER: P1085R4-1A 
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(ix) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: 650/225-5530 

(B) TELEFAX: 650/952-9881 
(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:i: 



15 CAGTCCAACT GTTCAGGACG CC 22 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 
^ : (C) STRANDEDNESS: Single 

11 (D) TOPOLOGY: Linear 

\p (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2 : 



135 



40 



45 



50 



GTGCTGCTCA TGCTGTAGGT GC 22 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GAAGTTGATG TCTTGTGAGT GGC 23 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



55 



GCATCCTAGA GTCACCGAGG AGCC 24 



(2) INFORMATION FOR SEQ ID NO: 5: 
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35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CACTGGCTCA GGGAAATAAC CC 22 

(2) INFORMATION FOR SEQ ID NO: 6: 

15 (i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDN ESS : Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GGAGAGCTGG GAAGGTGTGC AC 22 
(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 base pairs 
30 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
<D) TOPOLOGY: Linear 



<\i) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ACAAACGCGT ACGCTGACAT CGTCATGACC CAGTC 35 
(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



50 ACAAACGCGT ACGCTGATAT TGTCATGACT CAGTC 35 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 35 base pairs 

<B) TYPE: Nucleic Acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



ACAAACGCGT ACGCTGACAT CGTCATGACA CAGTC 35 
(2) INFORMATION FOR SEQ ID MO: 10: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



20 GCTCTTCGAA TGGTGGGAAG ATGGATACAG TTGGTGC 37 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
25 {A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



CGATGGGCCC GGATAGACCG ATGGGGCTGT TGTTTTGGC 39 

35 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 39 base pairs 

(B) TYPE: Nucleic Acid 
40 (C) STRANDEDNESS : Single 

{D} TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

45 

CGATGGGCCC GGATAGACTG ATGGGGCTGT CGTTTTGGC 39 
(2) INFORMATION FOR SEQ ID NO: 13: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
. (D) TOPOLOGY: Linear 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
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CGATGGGCCC GGATAGACGG ATGGGGCTGT TGTTTTGGC 39 
5 (2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 
10 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

15 

CGATGGGCCC GGATAGACAG ATGGGGCTGT TGTTTTGGC 39 
(2} INFORMATION FOR SEQ ID NO: 15: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CGATGGGCCC GGATAGACTG ATGGGGCTGT TGTTTTGGC 39 

30 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 69 base pairs 
35 ' (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

40 



GACATTGTCA 


TGACACAGTC 


TCAAAAATTC 


ATGTCCACAT 


CAGTAGGAGA 


50 


CAGGGTCAGC 


GTCACCTGCA 


AGGCCAGTCA 


GAATGTGGGT 


ACTAATGTAG 


100 


CCTGGTATCA 


ACAGAAACCA 


GGGCAATCTC 


CTAAAGCACT 


GATTTACTCG 


150 


TCATCCTACC 


GGTACAGTGG 


AGTCCCTGAT 


CGCTTCACAG 


GCAGTGGATC 


200 


TGGGACAGAT 


TTCACTCTCA 


CCATCAGCCA 


TGTGCAGTCT 


GAAGACTTGG 


250 


CAGACTATTT 


CTGTCAGCAA 


TATAACATCT 


ATCCTCTCAC 


GTTCGGTCCT 


300 


GGGACCAAGC 


TGGAGTTGAA 


ACGGGCTGAT 


GCTGCACCAC 


CAACTGTATC 


350 


CATCTTCCCA 


CCATTCGAA 369 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 
(BJ TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Asp lie Val Met Thr Gin Ser Gin Lys Phe Met Ser Thr Ser Val 
15 10 15 



Gly Asp Arg Val Ser Val Thr Cys Lys Ala Ser Gin Asn Val Gly 

15 20 25 30 

Thr Asn Val Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ser Pro Lys 

35 40 45 

20 Ala Leu lie Tyr Ser Ser Ser Tyr Arg Tyr Ser Gly Val Pro Asp 

50 55 60 



Arg Phe Thr Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie 
65 70 75 

Ser His Val Gin Ser Glu Asp Leu Ala Asp Tyr Phe Cys Gin Gin 

80 85 90 



Tyr Asn lie Tyr Pro Leu Thr Phe Gly Pro Gly Thr Lys Leu Glu 
30 95 100 105 

Leu Lys Arg Ala Asp Ala Ala Pro Pro Thr Val Ser lie Phe Pro 
110 115 120 

35 Pro Phe Glu 
123 

(2) INFORMATION FOR SEQ ID NO: 18: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



TTCTATTGCT ACAAACGCGT ACGCTGAGGT GCAGCTGGTG GAGTCTGGGG 50 

GAGGCTTAGT GCCGCCTGGA GGGTCCCTGA AACTCTCCTG TGCAGCCTCT 100 

GGATTCATAT TCAGTAGTTA TGGCATGTCT TGGGTTCGCC AGACTCCAGG 150 

55 CAAGAGCCTG GAGTTGGTCG CAACCATTAA TAATAATGGT GATAGCACCT 200 
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ATTATCCAGA CAGTGTGAAG GGCCGATTCA CCATCTCCCG AGACAATGCC 250 
AAGAACACCC TGTACCTGCA AATGAGCAGT CTGAAGTCTG AGGACACAGC 300 
CATGTTTTAC TGTGCAAGAG CCCTCATTAG TTCGGCTACT TGGTTTGGTT 350 
ACTGGGGCCA AGGGACTCTG GTCACTGTCT CTGCAGCCAA AACAACAGCC 400 
CCATCTGTCT ATCCGGG 417 
(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 130 amino acids 
15 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

20 Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Pro Pro Gly 
15 10 15 

Gly Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe lie Phe Ser 
20 25 30 

Ser Tyr Gly Met Ser Trp Val Arg Gin Thr Pro Gly Lys Ser Leu 
35 40 45 



Glu Leu Val Ala Thr lie Asn Asn Asn Gly Asp Ser Thr Tyr Tyr 

30 50 55 60 

Pro Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala 

65 70 75 

35 Lys* Asn Thr Leu Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp 

80 85 90 



Thr Ala Met Phe Tyr Cys Ala Arg Ala Leu lie Ser Ser Ala Thr 
95 100 105 

Trp Phe Gly Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ala 
110 115 120 



Ala Lys Thr Thr Ala Pro Ser Val Tyr Pro 
45 125 130 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
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ACAAACGCGT AC G C TG AT AT CGTCATGACA G 31 
(2) INFORMATION FOR SEQ ID NO: 21: 

5 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
{B} TYPE: Nucleic Acid 
(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



15 GCAGCATCAG CTCTTCGAAG CTCCAGCTTG G 31 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 21 base pairs 

(B) TYPE: DNA 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 : 



CCACTAGTAC GCAAGTTCAC G 21 

30 {2} INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
* (B) TYPE: Nucleic Acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

40 

GATGGGCCCT TGGTGGAGGC TGCAGAGACA GTG 33 
(2) INFORMATION FOR SEQ ID NO: 24: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



ATGAAGAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 

55 

TGCTACAAAC GCGTACGCTG ATATCGTCAT GACACAGTCT CAAAAATTCA 100 

250 
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TGTCCACATC AGTAGGAGAC AGGGTCAGCG TCACCTGCAA GGCCAGTCAG 150 
AATGTGGGTA CTAATGTAGC CTGGTATCAA CAGAAACCAG GGC AATCTCC 200 

5 

TAAAGCACTG ATTTACTCGT CATCCTACCG GTACAGTGGA GTCCCTGATC 250 
GCTTCACAGG CAGTGGATCT GGGACAGATT TCACTCTCAC CATCAGCCAT 300 
10 GTGCAGTCTG AAGACTTGGC AGACTATTTC TGTCAGCAAT ATAACATCTA 350 
TCCTCTCACG TTCGGTCCTG GGACCAAGCT GGAGCTTCGA AGAGCTGTGG 400 
CTGCACCATC TGTCTTCATC TTCCCGCCAT CTGATGAGCA GTTGAAATCT 450 

15 

GGAACTGCTT CTGTTGTGTG CCTGCTGAAT AACTTCTATC CCAGAGAGGC 500 
CAAAGTACAG TGGAAGGTGG ATAACGCCCT CCAATCGGGT AACTCCCAGG 550 
20 AGAGTGTCAC AGAGCAGGAC AGCAAGGACA GCACCTACAG CCTCAGCAGC 600 
ACCCTGACGC TGAGCAAAGC AGACTACGAG AAACACAAAG TCTACGCCTG 650 
CGAAGTCACC CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA 7 00 

25 

GGGGAGAGTG TTAA 714 
(2) INFORMATION FOR SEQ ID NO: 25: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 25: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

40 Ser He Ala Thr Asn Ala Tyr Ala Asp He Val Met Thr Gin Ser 

20 . 25 30 

Gin Lys Phe Met Ser Thr Ser Val Gly Asp Arg Val Ser Val Thr 
35 40 45 



45 



Cys Lys Ala Ser Gin Asn Val Gly Thr Asn Val Ala Trp Tyr Gin 
50 55 60 



Gin Lys Pro Gly Gin Ser Pro Lys Ala Leu He Tyr Ser Ser Ser 
50 65 70 75 

Tyr Arg Tyr Ser Gly Val Pro Asp Arg Phe Thr Gly Ser Gly Ser 
SO 85 90 

55 Gly Thr Asp Phe Thr Leu Thr He Ser His Val Gin Ser Glu Asp 

95 100 105 

251 
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Leu Ala Asp Tyr Phe Cys Gin Gin Tyr 

110 

5 Phe Gly Pro Gly Thr Lys Leu Glu Leu 

125 

Pro Ser Val Phe lie Phe Pro Pro Ser 
140 

10 

Gly Thr Ala Ser Val Val Cys Leu Leu 
155 

Glu Ala Lys Val Gin Trp Lys Val Asp 
15 170 

Asn Ser Gin Glu Ser Val Thr Glu Gin 
185 

20 Tyr Ser Leu Ser Ser Thr Leu Thr Leu 

200 

Lys His Lys Val Tyr Ala Cys Glu Val 

215 

25 

Ser Pro Val Thr Lys Ser Phe Asn Arg 
230 

(2) INFORMATION FOR SEQ ID NO: 26: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRAND EDNESS : Double 
35 (D) TOPOLOGY : Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



40 ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
TGCTACAAAC GCGTACGCTG AGGTGCAGCT GGTGGAGTCT GGGGGAGGCT 100 
TAGTGCCGCC TGGAGGGTCC CTGAAACTCT CCTGTGCAGC CTCTGGATTC 150 

45 

ATATTCAGTA GTTATGGCAT GTCTTGGGTT CGCCAGACTC CAGGCAAGAG 200 
CCTGGAGTTG GTCGCAACCA TTAATAATAA TGGTGATAGC ACCTATTATC 250 
50 CAGACAGTGT GAAGGGCCGA TTCACCATCT CCCGAGACAA TGCCAAGAAC 300 
ACCCTGTACC TGCAAATGAG CAGTCTGAAG TCTGAGGACA CAGCCATGTT 350 
TTACTGTGCA AGAGCCCTCA TTAGTTCGGC TACTTGGTTT GGTTACTGGG 400 

55 

GCCAAGGGAC TCTGGTCACT GTCTCTGCAG CCTCCACCAA GGGCCCATCG 450 

252 



Asn lie 
115 



Arg Arg 
130 



Asp Glu 
145 



Asn Asn 

160 

Asn Ala 
175 

Asp Ser 
190 

Ser Lys 
205 

Thr His 

220 

Gly Glu 

235 



Tyr Pro 

Ala Val 

Gin Leu 

Phe Tyr 

Leu Gin 

Lys Asp 

Ala Asp 

Gin Gly 

Cys 
237 



Leu Thr 

120 

Ala Ala 
135 



Lys Ser 
150 



Pro Arg 
165 

Ser Gly 

180 

Ser Thr 

195 

Tyr Glu 
210 

Leu Ser 
225 
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GTCTTCCCCC TGGCACCCTC CTCCAAGAGC ACCTCTGGGG GCACAGCGGC 500 
CCTGGGCTGC CTGGTCAAGG ACTACTTCCC CGAACCGGTG ACGGTGTCGT 550 

5 

GGAACTCAGG CGCCCTGACC AGCGGCGTGC ACACCTTCCC GGCTGTCCTA 600 
CAGTCCTCAG GACTCTACTC CCTCAGCAGC GTGGTGACCG TGCCCTCCAG 650 
10 CAGCTTGGGC ACCCAGACCT ACATCTGCAA CGTGAATCAC AAGCCCAGCA 700 
ACACCAAGGT GGACAAGAAA GTTGAGCCCA AATCTTGTGA CAAAACTCAC 750 
ACATGA 756 

15 

{2} INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 
20 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

25 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Giu Val Gin Leu Val Glu Ser 

20 25 30 

30 

Gly Gly Gly Leu Val Pro Pro Gly Gly Ser Leu Lys Leu Ser Cys 

35 40 45 

Ala .Ala Ser Gly Phe lie Phe Ser Ser Tyr Gly Met Ser Trp Val 
35 50 55 60 

Arg Gin Thr Pro Gly Lys Ser Leu Glu Leu Val Ala Thr He Asn 
65 70 75 

40 Asn Asn Gly Asp Ser Thr Tyr Tyr Pro Asp Ser Val Lys Gly Arg 

80 85 ■ 90 

Phe Thr He Ser Arg Asp Asn Ala Lys Asn Thr Leu Tyr Leu Gin 
95 100 105 

45 

Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Met Phe Tyr Cys Ala 
110 115 120 

Arg Ala Leu He Ser Ser Ala Thr Trp Phe Gly Tyr Trp Gly Gin 
50 125 130 135 

Gly Thr Leu Val Thr Val Ser Ala Ala Ser Thr Lys Gly Pro Ser 
140 145 150 

55 Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr 

155 160 165 

253 
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Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 
170 175 180 

5 Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr 

185 190 195 

Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser 

200 205 210 

10 

Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He 

215 220 225 

Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys 
15 230 235 240 

Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 

245 250 251 

20 (2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 
25 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

30 

CCAATGCATA CGCTGACATC GTGATGACCC AGACCCC 37 
(2) INFORMATION FOR SEQ ID NO: 29: 

35 (l) SEQUENCE CHARACTERISTICS : 

{A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



CCAATGCATA CGCTGATATT GTGATGACTC AGACTCC 37 

45 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i} SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 37 base pairs 
50 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

55 
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CCAATGCATA CGCTGACATC GTGATGACAC AGACACC 37 
{2) INFORMATION FOR SEQ ID HO: 31: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

0 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO:31: 



AGATGTCAAT TGCTCACTGG ATGGTGGGAA GATGG 35 

15 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
20 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



CAAACGCGTA CGCTGAGATC CAGCTGCAGC AG 32 
(2) INFORMATION FOR SEQ ID NO: 33: 

30 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



40 CAAACGCGTA CGCTGAGATT CAGCTCCAGC AG 32 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 3 91 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



GATATCGTGA TGACACAGAC ACCACTCTCC CTGCCTGTCA GTCTTGGAGA SO 
55 TCAGGCCTCC ATCTCTTGCA GATCTAGTCA GAGCCTTGTA CACGGTATTG 100 
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GAAACACCTA TTTACATTGG TACCTGCAGA AGCCAGGCCA GTCTCCAAAG 150 
CTCCTGATCT ACAAAGTTTC CAACCGATTT TCTGGGGTCC CAGACAQGTT 200 
CAGTGGCAGT GGATCAGGGA CAGATTTCAC ACTCAGGATC AGCAGAGTGG 250 
AGGCTGAGGA TCTGGGACTT TATTTCTGCT CTCAAAGTAC ACATGTTCCG 300 
CTCACGTTCG GTGCTGGGAC CAAGCTGGAG CTGAAAC GGG CTGATGCTGC 350 
ACCAACTGTA TCCATCTTCC CACCATCCAG TGAGCAATTG A 391 
(2) INFORMATION FOR SEQ ID NO: 35: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Asp lie Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu 
15 10 15 

25 Gly Asp Gin Ala Ser lie Ser Cys Arg Ser Ser Gin Ser Leu Val 

20 25 ' 30 

His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Leu Gin Lys Pro 
35 40 ' 45 



30 



45 



50 



Gly Gin Ser Pro Lys Leu Leu lie Tyr Lys Val Ser Asn Arg Phe 
50 55 60 



Ser .Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 

35 65 70 75 

Phe Thr Leu Arg lie Ser Arg Val Glu Ala Glu Asp Leu Gly Leu 

80 85 90 

40 Tyr Phe Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly Ala 

95 100 105 



Gly Thr Lys Leu Glu Leu Lys Arg Ala Asp Ala Ala Pro Thr Val 
110 115 120 

Ser lie Phe Pro Pro Ser Ser Glu Gin Leu Lys 
125 130 131 

(2) INFORMATION FOR SEQ ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
55 (D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



GAGATTCAGC TGCAGCAGTC TGGACCTGAG CTGATGAAGC CTGGGGCTTC 50 

5 

AGTGAAGATA TCCTGCAAGG CTTCTGGTTA TTCATTCAGT AGCCACTACA 100 
TGCACTGGGT GAAGCAGAGC CATGGAAAGA GCCTTGAGTG GATTGGCTAC 150 
10 ATTGATCCTT CCAATGGTGA AACTACTTAC AACCAGAAAT TCAAGGGCAA 200 
GGCCACATTG ACTGTAGACA CATCTTCCAG CACAGCCAAC GTGCATCTCA 250 
GCAGCCTGAC ATCTGATGAC TCTGCAGTCT ATTTCTGTGC AAGAGGGGAC 300 

15 

TATAGATACA ACGGCGACTG GTTTTTCGAT GTCTGGGGCG CAGGGACCAC 350 

GGTCACCGTC TCCTCCGCCA AAACCGACAG CCCCATCGGT CTATCCGGGC 400 

20 CCATC 405 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 13 5 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37 : 

30 

Glu He Gin Leu Gin Gin 

1 5 

Ala Ser Val Lys He Ser 
35 " 20 

Ser His Tyr Met His Trp 
35 

40 Glu Trp He Gly Tyr He 

50 

Asn Gin Lys Phe Lys Gly 
65 

45 

Ser Ser Thr Ala Asn Val 
80 

Ser Ala Val Tyr Phe Cys 
50 95 

Asp Trp Phe Phe Asp Val 

110 

55 Ser Ser Ala Lys Thr Asp 

125 



Ser Gly 

Cys Lys 
Val Lys 
Asp Pro 
Lys Ala 
His Leu 
Ala Arg 
Trp Gly 
Ser Pro 



Pro Glu Leu 
10 

Ala Ser Gly 

25 

Gin Ser His 
40 

Ser Asn Gly 
55 

Thr Leu Thr 
70 

Ser Ser Leu 
85 

Gly Asp Tyr 

100 

Ala Gly Thr 
115 

He Gly Leu 
130 



Met Lys Pro Gly 
15 

Tyr Ser Phe Ser 

30 

Gly Lys Ser Leu 
45 

Glu Thr Thr Tyr 

60 

Val Asp Thr Ser 
75 

Thr Ser Asp Asp 
90 

Arg Tyr Asn Gly 
105 

Thr Val Thr Val 

120 

Ser Gly Pro He 
135 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
CD) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



CTTGGTGGAG GCGGAGGAGA CG 22 
15 (2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: Nucleic Acid 
20 {C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

25 

GAAACGGGCT GTTGCTGCAC CAACTGTATT CATCTTCC 38 

(2) INFORMATION FOR SEQ ID NO: 40: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 31 base pairs 
<B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



35 



40 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

GTCACCGTCT CCTCCGCCTC CACCAAGGGC C 31 
(2) INFORMATION FOR SEQ ID NO: 41: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 729 base pairs 
45 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATGAAGAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 

TGCTACAAAT GCATACGCTG ATATCGTGAT GACACAGACA CCACTCTCCC 100 

TGCCTGTCAG TCTTGGAGAT CAGGCCTCCA TCTCTTGCAG ATCTAGTCAG 150 
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AGCCTTGTAC ACGGTATTGG AAACACCTAT TTACATTGGT ACCTGCAGAA 200 

GCCAGGCCAG TCTCCAAAGC TCCTGATCTA CAAAGTTTCC AACCGATTTT 250 

5 

CTGGGGTCCC AGACAGGTTC AGTGGCAGTG GATCAGGGAC AGATTTCACA 300 

CTCAGGATCA GCAGAGTGGA GGCTGAGGAT CTGGGACTTT ATTTCTGCTC 350 

10 TCAAAGTACA CATGTTCCGC TCACGTTCGG TGCTGGGACC AAGCTGGAGC 400 

TGAAACGGGC TGTTGCTGCA CCAACTGTAT TCATCTTCCC ACCATCCAGT 450 

GAGCAATTGA AATCTGGAAC TGCCTCTGTT GTGTGCCTGC TGAATAACTT 500 

15 

CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 550 

CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GG AC AGCACC 600 

20 TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 

CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 
CAAAGAGCTT CAACAGGGGA GAGTGTTAA 729 

25 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 
30 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

35 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Val Met Thr Gin Thr 
20 25 30 

40 

Pro Leu Ser Leu Pro Val Ser Leu Gly Asp Gin Ala Ser lie Ser 
35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly lie Gly Asn Thr Tyr 
45 50 55 60 

Leu His Trp Tyr Leu Gin Lys Pro Gly Gin Ser Pro Lys Leu Leu 
65 70 75 

50 He Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Asp Arg Phe 

80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Arg He Ser Arg 
95 100 105 

55 

Val Glu Ala Glu Asp Leu Gly Leu Tyr Phe Cys Ser Gin Ser Thr 
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110 115 120 

His Val Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu Leu Lvs 
125 130 135 

5 

Arg Ala Val Ala Ala Pro Thr Val Phe lie Phe Pro Pro Ser Ser 
140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 
10 155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 
170 175 180 

15 Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 



20 



40 



Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 
200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 
215 220 225 



His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
25 230 235 240 

Glu Cys 
242 

30 (2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: Nucleic Acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
TGCTACAAAC GCGTACGCTG AGATTCAGCT GCAGCAGTCT GGACCTGAGC 100 
45 TGATGAAGCC TGGGGCTTCA GTGAAGATAT CCTGCAAGGC TTCTGGTTAT 150 
TCATTCAGTA GCCACTACAT GCACTGGGTG AAGCAGAGCC ATGGAAAGAG 200 
CCTTGAGTGG ATTGGCTACA TTGATCCTTC CAATGGTGAA ACTACTTACA 2 50 

50 

ACCAGAAATT CAAGGGCAAG GCCACATTGA CTGTAGACAC ATCTTCCAGC 300 
ACAGCCAACG TGCATCTCAG CAGCCTGACA TCTGATGACT CTGCAGTCTA 350 
55 TTTCTGTGCA AGAGGGGACT ATAGATACAA CGGCGACTGG TTTTTCGATG 400 
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TCTGGGGCGC AGGGACCACG GTCACCGTCT CCTCCGCCTC CACCAAGGGC 450 
CCATCGGTCT TCCCCCTGGC ACCCTCCTCC AAGAGCACCT CTGGGGGCAC 500 
5 AGCGGCCCTG GGCTGCCTGG TCAAGGACTA CTTCCCCGAA CCGGTGACGG 550 
TGTCGTGGAA CTCAGGCGCC CTGACCAGCG GCGTGCACAC CTTCCCGGCT 600 
GTCCTACAGT CCTCAGGACT CTACTCCCTC AGCAGCGTGG TGACCGTGCC 650 

10 

CTCCAGCAGC TTGGGCACCC AGACCTACAT CTGCAACGTG AATCACAAGC 700 

CCAGCAACAC CAAGGTGGAC AAGAAAGTTG AGCCCAAATC TTGTGACAAA 750 

15 ACTCACACAT GA 762 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 253 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

25 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
1 5 10 . 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu lie Gin Leu Gin Gin Ser 
30 20 25 30 

Giy Pro Glu Leu Met Lys Pro Gly Ala Ser Val Lys lie Ser Cys 
35 40 45 

35 Lys Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 

50 55 60 

Lys Gin Ser His Gly Lys Ser Leu Glu Trp lie Gly Tyr lie Asp 

65 70 75 

40 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Lys 

80 85 90 

Ala Thr Leu Thr Val Asp Thr Ser Ser Ser Thr Ala Asn Val His 

45 95 100 105 

Leu Ser Ser Leu Thr Ser Asp Asp Ser Ala Val Tyr Phe Cys Ala 
110 115 120 

50 Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

Gly Ala Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 

55 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 
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155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 

5 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
135 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
10 200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
215 220 225 

15 Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 



20 



35 



50 



55 



Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 253 

(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 
25 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

30 Asp lie Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu 
15 10 15 

Gly Asp Gin Ala Ser lie Ser Cys Arg Ser Ser Gin Ser Leu Val 
20 25 30 



His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Leu Gin Lys Pro 
35 40 45 



Gly Gin Ser Pro Lys Leu Leu lie Tyr Tyr Lys Val Ser Asn Arg 

40 50 55 60 

Phe Ser Gly Val Pro Asp Arg Phe Ser Asp Ser Gly Ser Gly Thr 

65 70 75 

45 Asp Phe Thr Leu Arg lie Ser Arg Val Glu Ala Glu Asp Leu Gly 

80 85 90 



Leu Tyr Phe Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly 
95 100 105 

Ala Gly Thr Lys Leu Glu Leu Lys Arg 
110 114 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 114 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID HO: 46: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
15 10 15 

10 Gly Asp Arg Val Thr lie Thr Cys Arg Ser Ser Gin Ser Leu Val 

20 25 30 



15 



30 



45 



His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Gin Gin Lys Pro 
35 40 45 

Gly Lys Ala Pro Lys Leu Leu lie Tyr Tyr Lys Val Ser Asn Arg 
50 55 60 



Phe Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr 

20 65 70 75 

Asp Phe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu Asp Phe Ala 

80 85 90 

25 Thr Tyr Tyr Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly 

95 100 105 



Gin Gly Thr Lys Val Glu He Lys Arg 
110 114 

(2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 109 amino acids 
35 (B) TYPE: PRT 

<D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

40 Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
15 10 15 

Gly Asp Arg Val Thr He Thr Cys Arg Ala Ser Lys Thr He Ser 
20 25 30 



Lys Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys 
35 40 45 



Leu Leu He Tyr Tyr Ser Gly Ser Thr Leu Glu Ser Gly Val Pro 
50 50 55 60 



Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr 

65 70 75 

55 He Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin 

80 85 90 
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Gin His Asn Glu Tyr Pro Leu Thr Phe Gly Gin Gly Thr Lys Val 
95 100 105 

5 Glu lie Lys Arg 
109 

(2) INFORMATION FOR SEQ ID NO: 48: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 
<B) TYPE: Amine Acid 
(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Glu lie Gin Leu Gin Gin Ser Gly Pro Glu Leu Met Lys Pro Gly 
15 10 15 

20 Ala Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Ser Phe Ser 

20 25 30 



25 



40 



55 



Ser His Tyr Met His Trp Val Lys Gin Ser His Gly Lys Ser Leu 
35 40 45 

Glu Trp lie Gly Tyr He Asp Pro Ser Asn Gly Glu Thr Thr Tyr 
50 55 60 



Asn Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Thr Ser 

30 65 70 75 

Ser Ser Thr Ala Asn Val His Leu Ser Ser Leu Thr Ser Asp Asp 

80 85 90 

35 Ser Ala Val Tyr Phe Cys Ala Ala Arg Gly Asp Tyr Arg Tyr Asn 

95 100 ~ 105 



Gly Asp Trp Phe Phe Asp Val Trp Gly Ala Gly Thr 
110 115 117 

(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 
45 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

50 Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 
15 10 15 



Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Tyr Ser Phe Ser 

20 25 30 

Ser His Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 
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35 40 45 

Glu Trp Val Gly Tyr lie Asp Pro Ser Asn Gly Glu Thr Thr Tyr 

50 55 60 

5 

Asn Gin Lys Phe Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser 

65 70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

10 80 85 90 

Thr Ala Val Tyr Tyr Cys Ala Ala Arg Gly Asp Tyr Arg Tyr Asn 

95 100 105 

15 Gly Asp Trp Phe Phe Asp Val Trp Gly Gin Gly Thr 

110 115 117 

(2) INFORMATION FOR SEQ ID NO: 50: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: PRT 

<D) TOPOLOGY: Linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 

15 10 15 

30 Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Thr 

20 25 30 



35 



50 



Gly His Trp Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 

35 40 45 

Glu Trp Val Gly Met lie His Pro Ser Asp Ser Glu Thr Arg Tyr 

50 55 60 



Ala Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser 

40 65 70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

80 85 90 

45 Thr Ala Val Tyr Tyr Cys Ala Ala Arg Gly lie Tyr Phe Tyr Gly 

95 100 105 



Thr Thr Tyr Phe Asp Tyr Trp Gly Gin Gly Thr 
110 115 116 

(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 
55 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51; 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
5 1 5 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 
20 25 30 

10 Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr 

35 40 45 



15 



30 



45 



Cys Arg Ser Ser Gin Ser Leu Val His Gly He Gly Asn Thr Tyr 
50 55 60 

Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 

65 70 75 



He Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 

20 80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser 

95 100 105 

25 Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

110 115 120 



His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys 
125 130 135 

Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 
140 145 150 



Glu % Gln Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 

35 * 155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 

170 175 180 

40 Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 



Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 

200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 

215 220 225 



His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
50 230 235 240 

Glu Cys 
242 

55 (2) INFORMATION FOR SEQ ID NO: 52: 
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{ i ) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 253 amino acids 
(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 

15 10 15 

10 

Ser He Ala Thr Asn Ala Tyr Ala Giu Val Gin Leu Val Gin Ser 

20 25 30 

Gly Gly Gly Leu Val Gin Pro Giy Gly Ser Leu Arg Leu Ser Cys 
15 35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 

50 55 60 

20 Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr lie Asp 

65 70 75 



Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 

80 85 90 

25 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 

95 100 , 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr "Ala Val Tyr Tyr Cys Ala 

30 110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

35 Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 



40 



55 



Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 

170 175 180 



Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Giy Val 
45 185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
200 205 210 

50 Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 



Tyr He Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 

267 



Patent Docket No. PI085R4-1 A 



245 250 253 

(2) INFORMATION FOR SEQ ID NO: 53: 

5 <i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 159 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Ser Gly Gly Gly Ser Gly Ser Gly Asp Phe Asp Tyr Glu Lys Met 
15 10 15 

15 Ala Asn Ala Asn Lys Gly Ala Met Thr Glu Asn Ala Asp Glu Asa 

20 25 30 



20 



35 



Ala Leu Gin Ser Asp Ala Lys Gly Lys Leu Asp Ser Val Ala Thr 

35 40 45 

Asp Tyr Gly Ala Ala lie Asp Gly Phe lie Gly Asp Val Ser Gly 

50 55 60 



Leu Ala Asn Gly Asn Gly Ala Thr Gly Asp Phe Ala Gly Ser Ser 

25 65 70 ^ 75 

Asn Ser Gin Met Ala Gin Val Gly Asp Gly Asp Asn Ser Pro Leu 

80 85 90 

30 Met Asn Asn Phe Arg Gin Tyr Leu Pro Ser Leu Pro Gin Ser Val 

95 100 105 



Glu Cys Arg Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr Glu Phe 
110 115 120 

Ser lie Asp Cys Asp Lys lie Asn Leu Phe Arg Gly Val Phe Ala 
125 130 135 



Phe Leu Leu Tyr Val Ala Thr Phe Met Tyr Val Phe Ser Thr Phe 
40 140 145 150 

Ala Asn lie Leu Arg Asn Lys Glu Ser 
155 159 

45 - (2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 780 base pairs 
{ B) TYPE: Nucleic Acid 
50 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

55 

ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 

268 



^ J 



Patent Docket No. P1085R4-1A 



TGCTACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 150 

5 

AGCTTAGTAC ATGGTATAGG TAACACGTAT TT AC AC TGGT ATCAACAGAA 200 
ACCAGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 250 
10 CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GG ATTTCACT 300 
CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 350 
ACAGAGTACT CATGTCCCGC TCACGTTTGG ACAGGGTACC AAGGTGGAGA 400 

15 

TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 450 
GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 
20 CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 550 
CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 
TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 

25 

CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 

CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 750 

30 CATCGTGGCC CTAGTACGCA ACTAGTCGTA 7 80 

(2) INFORMATION FOR SEQ ID NO: 55: 

Li) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2 53 amino acids 

<B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

40 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Giu Ser 
45 20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

50 Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 

50 55 t 60 

Lys Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr He Asp 
65 70 75 

55 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 
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80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 

5 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 
10 125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
140 145 150 

15 Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 



20 



1:35 



50 



Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
185 190 195 



His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
J 25 200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

| 215 220 225 

'30 Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 



Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 253 

{2} INFORMATION FOR SEQ ID NO: 56: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 
40 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

45 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 
20 25 30 



Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr 
35 40 45 



Cys Arg Ser Ser Gin Ser Leu Val His Gly He Gly Ala Thr Tyr 
55 50 55 60 
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Leu Kis Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 

65 70 75 

lie Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 

5 80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser 

95 100 105 

10 Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

110 115 120 



15 



30 



His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu lie Lys 

125 130 135 

Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 

140 145 150 



Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 

20 155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 

170 175 180 

25 Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 



Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 

200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 

215 220 225 



His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
35 230 235 240 

Glu Cys 
242 

40 (2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 45 amino acids 
<B) TYPE: Amino Acid 
45 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Arg Met Lys 
50 1 5 10 15 

Gin Leu Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn Tyr His 
20 25 30 

55 Leu Glu Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu Arg 

35 40 45 



271 



Patent Docket No. P1085R4-1A 



(2) INFORMATION FOR SEQ 10 NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH : 7 80 base pairs 

<B) TYPE: Nucleic Acid 

(C) S TRANDEDN ESS: Single 

(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
15 TGCTACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 150 
AGCTTAGTAC ATGGTATAGG TGCTACGTAT TTACACTGGT ATCAACAGAA 200 

20 

ACCAGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 250 
CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GGATTTCACT 300 
25 CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 350 
ACAGAGTACT CATGTCCCGC TCACGTTTGG ACAGGGTACC AAGGTGGAGA 400 
TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 450 

30 

GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 
CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 550 
35 CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 
TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 
CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 

40 

CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 750 
CATCGTGGCC CTAGTACGCA ACTAGTCGTA 780 
45 (2) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 base pairs 

(B) TYPE: Nucleic Acid 
50 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

AAAAGGGTAT CTAGAGGTTG AGGTGATTTT ATGAAAAAGA ATATCGCATT 50 
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TCTTCTTGCA TCTATGTTCG TFTTTTCTAT TGCTACAAAC GCGTACGCTG 100 
AGGTTCAGCT AGTGCAGTCT GGCGGTGGCC TGGTGCAGCC AGGGGGCTCA 150 

5 

CTCCGTTTGT CCTGTGCAGC TTCTGGCTAC TCCTTCTCGA GTCACTATAT 200 



GCACTGGGTC CGTCAGGCCC CGGGTAAGGG CCTGGAATGG GTTGGATATA 250 
10 TTGATCCTTC CAATGGTGAA ACTACGTATA ATCAAAAGTT CAAGGGCCGT 300 
TTCACTTTAT CTCGCGACAA CTCCAAAAAC ACAGCATACC TGCAGATGAA 350 
CAGCCTGCGT GCTGAGGACA CTGCCGTCTA TTACTGTGCA AGAGGGGATT 400 

15 

ATCGCTACAA TGGTGACTGG TTCTTCGACG TCTGGGGTCA AGGAACCCTG 450 
GTCACCGTCT CCTCGGCCTC CACCAAGGGC CCATCGGTCT TCCCCCTGGC 500 
20 ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG 550 
TCAAGGACTA CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC 600 
CTGACCAGCG GCGTGCACAC CTTCCCGGCT GTCCTACAGT CCTCAGGACT 650 

25 

CTACTCCCTC AGCAGCGTGG TGACCGTGCC CTCCAGCAGC TTGGGCACCC 700 
AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC CAAGGTCGAC 7 50 
30 AAGAAAGTTG AGCCCAAATC TTGTGACAAA ACTCACACAT GCCCGCCGTG 800 
CCCAGCACCA GAACTGCTGG GCGGCCGCAT' GAAACAGCTA GAGGACAAGG 850 
TCGAAGAGCT ACTCTCCAAG AACTACCACC TAGAGAATGA AGTGGCAAGA 900 

35 

CTCAAAAAGC TTGTCGGGGA GCGCTAA 927 
(2) INFORMATION FOR SEQ ID NO: 60: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

50 Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 

20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 " 45 

55 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
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50 55 60 

Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr He Asp 

65 70 75 

5 

Pro Ser Asn Gly Glu Thr Thr Tyr Asa Gin Lys Phe Lys Gly Arg 

80 85 { 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 
10 95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
110 115 120 

15 Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 



20 



35 



50 



Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 
155 160 165 



Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 

25 170 175 . 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

185 190 195 

30 His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 

200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 



Tyr He Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 
230 235 240 



Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 

40 245 250 255 

Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Arg Met Lys Gin Leu 

260 265 270 

45 Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn Tyr His Leu Glu 

275 280 285 

Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu Arg 

290 295 298 



(2) INFORMATION FOR SEQ ID NO: 61: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6563 base pairs 
55 (B) TYPE: Nucleic Acid 

{C} STRANDEDNESS : Single 
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(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 61: 

5 

GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC ATGAAAAATC 50 
TCATTGCTGA GTTGTTATTT AAGCTTGCCC AAAAAGAAGA AGAGTCGAAT 100 
10 GAACTGTGTG CGCAGGTAGA AGCTTTGGAG ATTATCGTCA CTGCAATGCT 150 
TCGCAATATG GCGCAAAATG ACCAACAGCG GTTGATTGAT CAGGTAGAGG 200 
GGGCGCTGTA CGAGGTAAAG CCCGATGCCA GCATTCCTGA CGACGATACG 250 

15 

GAGCTGCTGC GCGATTACGT AAAGAAGTTA TTGAAGCATC CTCGTCAGTA 300 
AAAAGTTAAT CTTTTCAACA GCTGTCATAA AGTTGTCACG GCCGAGACTT 350 
20 ATAGTCGCTT TGTTTTTATT TTTTAATGTA TTTGTAACTA GAATTCGAGC 400 
TCGGTACCCG GGG ATCCTCT CGAGGTTGAG GTGATTTTAT GAAAAAGAAT 450 
ATCGCATTTC TTCTTGCATC TATGTTCGTT TTTTCTATTG CTACAAACGC 500 

25 

ATACGCTGAT ATCCAGATGA CCCAGTCCCC GAGCTCCCTG TCCGCCTCTG 550 
TGGGCGATAG GGTCACCATC ACCTGCAGGT CAAGTCAAAG CTTAGTACAT 600 
30 GGTATAGGTG CTACGTATTT ACACTGGTAT CAACAGAAAC CAGGAAAAGC 650 
TCCGAAACTA CTGATTTACA AAGTATCCAA TCGATTCTCT GGAGTCCCTT 7 00 
CTCGCTTCTC TGGATCCGGT TCTGGGACGG ATTTCACTCT GACCATCAGC 750 

35 

AGTCTGCAGC CAGAAGACTT CGCAACTTAT TACTGTTCAC AGAGTACTCA 800 
TGTCCCGCTC ACGTTTGGAC AGGGTACCAA GGTGGAGATC AAACGAACTG 850 
40 TGGCTGCACC ATCTGTCTTC ATCTTCCCGC CATCTGATGA GCAGTTGAAA 900 
TCTGGAACTG CTTCTGTTGT GTGCCTGCTG AATAACTTCT ATCCCAGAGA 950 
GGCCAAAGTA CAGTGGAAGG TGGATAACGC CCTCCAATCG GGTAACTCCC 1000 

45 

AGGAGAGTGT CACAGAGCAG GACAGCAAGG ACAGCACCTA CAGCCTCAGC 1050 
AGCACCCTGA CGCTGAGCAA AGCAGACTAC GAGAAACACA AAGTCTACGC 1100 
50 CTGCGAAGTC ACCCATCAGG GCCTGAGCTC GCCCGTCACA AAGAGCTTCA 1150 
ACAGGGGAGA GTGTTAAGCT GATCCTCTAC GCCGGACGCA TCGTGGCCCT 1200 
AGTACGCAAC TAGTCGTAAA AAGGGTATCT AGAGGTTGAG GTGATTTTAT 1250 
GAAAAAGAAT ATCGCATTTC TTCTTGCATC TATGTTCGTT TTTTCTATTG 1300 

275 
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CTACAAACGC GTACGCTGAG GTTCAGCTAG TGCAGTCTGG CGGTGGCCTG 1350 
GTGCAGCCAG GGGGCTCACT CCGTTTGTCC TGTGCAGCTT CTGGCTACTC 1400 

5 

CTTCTCGAGT CACTATATGC ACTGGGTCCG TCAGGCCCCG GGTAAGGGCC 1450 
TGGAATGGGT TGG ATATATT GATCCTTCCA ATGGTGAAAC TACGTATAAT 1500 
10 CAAAAGTTCA AGGGCCGTTT CACTTTATCT CGCGACAACT CCAAAAACAC 1550 
AGCATACCTG CAGATGAACA GCCTGCGTGC TGAGGACACT GCCGf CTATT 1600 
ACTGTGCAAG AG GGG ATT AT CGCTACAATG GTGACTGGTT CTTCGACGTC 1650 

15 

TGGGGTCAAG GAACCCTGGT CACCGTCTCC TCGGCCTCCA CCAAGGGCCC 1700 
ATCGGTCTTC CCCCTGGCAC CCTCCTCCAA GAGCACCTCT GGGGGCACAG 17 50 
20 CGGCCCTGGG CTGCCTGGTC AAGGACTACT TCCCCGAACC GGTGACGGTG 1800 
TCGTGGAACT CAGGCGCCCT GACCAGCGGC GTGCACACCT TCCCGGCTGT 1850 
CCTACAGTCC TCAGGACTCT ACTCCCTCAG CAGCGTGGTG ACCGTGCCCT 1900 

25 

CCAGCAGCTT GGGCACCCAG ACCTACATCT GCAACGTGAA TCACAAGCCC 1950 
AGCAACACCA AGGTCGACAA GAAAGTTGAG CCCAAATCTT GTGACAAAAC 2000 
30 TCACACATGC CCGCCGTGCC CAGCACCAGA ACTGCTGGGC GGCCGCATGA 2 050 
AACAGCTAGA GGACAAGGTC GAAGAGCTAC TCTCCAAGAA CTACCACCTA 2100 
GAGAATGAAG TGGCAAGACT CAAAAAGCTT GTCGGGGAGC GCTAAGCATG 2150 

35 

CGACGGCCCT AGAGTCCCTA ACGCTCGGTT GCCGCCGGGC GTTTTTTATT 2200 
GTTAACTCAT GTTTGACAGC TTATCATCGA TAAGCTTTAA TGCGGTAGTT 2250 
40 TATCACAGTT AAATTGCTAA CGCAGTCAGG CACCGTGTAT GAAATCTAAC 2300 
AATGCGCTCA TCGTCATCCT CGGCACCGTC ACCCTGGATG CTGTAGGCAT 2 3 50 
AGGCTTGGTT ATGCCGGTAC TGCCGGGCCT CTTGCGGGAT ATCGTCCATT 2400 

45 

CCGACAGCAT CGCCAGTCAC TATGGCGTGC TGCTAGCGCT ATATGCGTTG 2450 
ATGCAATTTC TATGCGCACC CGTTCTCGGA GCACTGTCCG ACCGCTTTGG 2500 
50 CCGCCGCCCA GTCCTGCTCG CTTCGCTACT TGGAGCCACT ATCGACTACG 2550 
CGATCATGGC GACCACACCC GTCCTGTGGA TCCTCTACGC CGGACGCATC 2600 
GTGGCCGGCA TCACCGGCGC CACAGGTGCG GTTGCTGGCG CCTATATCGC 2650 
CGACATCACC GATGGGGAAG ATCGGGCTCG CCACTTCGGG CTCATGAGCG 27 00 
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CTTGTTTCGG CGTGGGTATG GTGGCAGGCC CCGTGGCCGG GGGACTGTTG 2750 



GGCGCCATCT CCTTGCACGC ACCATTCCTT GCGGCGGCGG TGCTCAACGG 2800 

5 

CCTCAACCTA CTACTGGGCT GCTTCCTAAT GCAGGAGTCG CATAAGGGAG 2850 



AGCGTCGTCC GATGCCCTTG AGAGCCTTCA ACCCAGTCAG CTCCTTCCGG 2900 
10 TGGGCGCGGG GCATGACTAT CGTCGCCGCA CTTATGACTG TCTTCTTTAT 2950 



CATGCAACTC GTAGGACAGG TGCCGGCAGC GCTCTGGGTC ATTTTCGGCG 3000 
AGGACCGCTT TCGCTGGAGC GCGACGATGA TCGGCCTGTC GCTTGCGGTA 3050 

15 

TTCGGAATCT TGCACGCCCT CGCTCAAGCC TTCGTCACTG GTCCCGCCAC 3100 



CAAACGTTTC GGCGAGAAGC AGGCCATTAT CGCCGGCATG GCGGCCGACG 3150 
20 CGCTGGGCTA CGTCTTGCTG GCGTTCGCGA CGCGAGGCTG GATGGCCTTC 3200 



CCCATTATGA TTCTTCTCGC TTCCGGCGGC ATCGGGATGC CCGCGTTGCA 3250 



GGCCATGCTG TCCAGGCAGG TAGATGACGA CCATCAGGGA CAGCTTCAAG 3300 

25 

GATCGCTCGC GGCTCTTACC AGCCTAACTT CGATCACTGG ACCGCTGATC 3350 



GTCACGGCGA TTTATGCCGC CTCGGCGAGC ACATGGAACG GGTTGGCATG 3400 
30 GATTGTAGGC GCCGCCCTAT ACCTTGTCTG CCTCCCCGCG TTGCGTCGCG 3450 



GTGCATGGAG CCGGGCCACC TCGACCTGAA TGGAAGCCGG CGGCACCTCG 3 500 



CTA&CGGATT CACCACTCCA AGAATTGGAG CCAATCAATT CTTGCGGAGA 3550 

35 

ACTGTGAATG CGCAAACCAA CCCTTGGCAG AACATATCCA TCGCGTCCGC 3600 



CATCTCCAGC AGCCGCACGC GGCGCATCTC GGGCAGCGTT GGGTCCTGGC 3650 
40 CACGGGTGCG CATGATCGTG CTCCTGTCGT TGAGGACCCG GCTAGGCTGG 37 00 



CGGGGTTGCC TTACTGGTTA GCAGAATGAA TCACCGATAC GCGAGCGAAC 37 50 



GTGAAGCGAC TGCTGCTGCA AAACGTCTGC GACCTGAGCA ACAACATGAA 3 800 

45 

TGGTCTTCGG TTTCCGTGTT TCGTAAAGTC TGGAAACGCG GAAGTCAGCG 3 850 



CCCTGCACCA TTATGTTCCG GATCTGCATC GCAGGATGCT GCTGGCTACC 3900 
50 CTGTGGAACA CCTACATCTG TATTAACGAA GCGCTGGCAT TGACCCTGAG 3950 



TGATTTTTCT CTGGTCCCGC CGCATCCATA CCGCCAGTTG TTTACCCTCA 4000 



CAACGTTCCA GTAACCGGGC ATGTTCATCA TCAGTAACCC GTATCGTGAG 4050 

55 

CATCCTCTCT CGTTTCATCG GTATCATTAC CCCCATGAAC AGAAATTCCC 4100 
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CCTTACACGG AGGCATCAAG TGACCAAACA GGAAAAAACC GCCCTTAACA 4150 
TGGCCCGCTT TATCAGAAGC CAGACATTAA CGCTTCTGGA GAAACTCAAC 4200 

5 

GAGCTGGACG CGGATGAACA GGCAGACATC TGTGAATCGC TTCACGACCA 4250 
CGCTGATGAG CTTTACCGCA GCTGCCTCGC GCGTTTCGGT QATGhCQGTQ 4300 
10 AAAACCTCTG ACACATGCAG CTCCCGGAGA CGGTCACAGC TTGTCTGTAA 4350 
GCGGATGCCG GG AGO AG AC A AGCCCGTCAG GGCGCGTCAG CGGGTGTTGG 4400 
CGGGTGTCGG GGCGCAGCCA TGACCCAGTC ACGTAGCGAT AGCGGAGTGT 4450 

15 

ATACTSGCTT AACTATGCGG CATCAGAGCA GATTGTACTG AGAGTGCACC 4500 
ATATGCGGTG TGAAATACCG CACAGATGCG TAAGGAGAAA ATACCGCATC 4550 
20 AGGCGCTCTT CCGCTTCCTC GCTCACTGAC TCGCTGCGCT CGGTCGTTCG 4600 
GCTGCGGCGA GCGGTATCAG CTCACTCAAA GGCGGTAATA CGGTTATCCA 4650 
CAGAATCAGG GGATAACGCA GG AAAGAACA TGTGAGCAAA AGGCCAGCAA 4700 

25 

AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG CTGGCGTTTT TCCATAGGCT 4750 
CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 4800 
30 GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC 4850 
CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC 4900 
CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC TCATAGCTCA CGCTGTAGGT 4950 

35 

ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG TGTGCACGAA 5000 
CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT ATCGTCTTGA 5050 
40 GTCCAACCCG GTAAGACACG ACTTATCGCC ACTGGCAGCA GCCACTGGTA 5100 
ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG 5150 
TGGTGGCCTA ACTACGGCTA CACTAGAAGG ACAGTATTTG GTATCTGCGC 5200 

45 

TCTGCTGAAG CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG 5250 
GCAAACAAAC CACCGCTGGT AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG 5300 
50 ATTACGCGCA GAAAAAAAGG ATCTCAAGAA GATCCTTTGA TCTTTTCTAC 5350 
GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG ATTTTGGTCA 5400 
TGAGATTATC AAAAAGGATC TTCACCTAGA TCCTTTTAAA TTAAAAATGA 5450 

55 

AGTTTTAAAT CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA 5500 
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CCAATGCTTA ATCAGTGAGG CACCTATCTC AGCGATCTGT CTATTTCGTT 5550 

CATCCATAGT TGCCTGACTC CCCGTCGTGT AGATAACTAC GATACGGGAG 5600 

5 

GGCTTACCAT CTGGCCCCAG TGCTGCAATG ATACCGCGAG ACCCACGCTC 5650 



ACCGGCTCCA GATTTATCAG CAATAAACCA GCCAGCCGGA AGGGCCGAGC 5700 
10 GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC TATTAATTGT 5750 
TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT 5800 



TGTTGCCATT GCTGCAGGCA TCGTGGTGTC ACGCTCGTCG TTTGGTATGG 5850 

15 

CTTCATTCAG CTCCGGTTCC CAACGATCAA GGCGAGTTAC ATGATCCCCC 5900 



ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG 5950 
20 AAGTAAGTTG GCCGCAGTGT TATCACTCAT GGTTATGGCA GCACTGCATA 6000 



ATTCTCTTAC TGTCATGCCA TCCGTAAGAT GCTTTTCTGT GACTGGTGAG 6050 



TACTCAACCA AGTCATTCTG AGAATAGTGT ATGCGGCGAC CGAGTTGCTC 6100 

25 

TTGCCCGGCG TCAACACGGG ATAATACCGC GCCACATAGC AGAACTTTAA 6150 

AAGTGCTCAT CATTGGAAAA CGTTCTTCGG GGCGAAAACT CTCAAGGATC 6200 

30 TTACCGCTGT TGAGATCCAG TTCGATGTAA CCCACTCGTG CACCCAACTG 6250 
ATCTTCAGCA TCTTTTACTT TCACCAGCGT ' TTCTGGGTGA GCAAAAACAG 6300 

GAAGGCAAAA TGCCGCAAAA AAGGGAATAA GGGCGACACG GAAATGTTGA 6350 

35 

ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA 6400 



TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA 6450 
40 TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCTGA CGTCTAAGAA 6500 
AC C ATT ATT A TCATGACATT AACCTATAAA AATAGGCGTA TCACGAGGCC 6550 



CTTTCGTCTT CAA 6563 

45 

(2) INFORMATION FOR SEQ ID NO: 62: 

(ij SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 242 amino acids 
50 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

55 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 
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10 



25 



40 



Ser lie Ala Thr Asn Ala Tyr Ma Asp He Gin Met Thr Gin Ser 

20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 

35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly He Gly Glu Thr Tyr 

50 55 60 

Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 

65 70 75 



He Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 

15 80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser 

95 100 105 

20 Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

110 115 120 



His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys 

125 130 135 

Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 

140 145 150 



Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 

30 155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 

170 175 180 

35 Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 

Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 

200 205 210 



Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 
215 220 225 



His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
45 230 235 240 

Glu Cys 
242 

50 (2} INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: Nucleic Acid 
55 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 
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{xi} SEQUENCE DESCRIPTION: SEQ ID MC:63 : 



5 CATGGTATAG GTTAAACTTA TTTACAC 27 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 27 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
{D| TOPOLOGY : Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



CATGGTATAG GTNNSACTTA TTTACAC 27 
20 (2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: Nucleic Acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

30 

ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
TGCTACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
35 TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 150 
AGCTTAGTAC ATGGTATAGG TGAGACGTAT TTACACTGGT ATCAACAGAA 200 
ACCAGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 2 50 

40 

CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GGATTTCACT 300 
CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 350 
45 ACAGAGTACT CATGTCCCGC TCACGTTTGG ACAGGGTACC AAGGTGGAGA 400 
TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 450 
GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 

50 

CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 550 
CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GG AC AG CACC 600 
55 TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 
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CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 

CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 7 50 

5 CATCGTGGCC CTAGTACGCA ACTAGTCGTA 780 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 78 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY; Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

CTAGTGCAGT CTGGCGGTGG CCTGGTGCAG CCAGGGGGCT CACTCCGTTT 50 

20 GTCCTGTGCA GCTTCTGGCT ACTCCTTC 78 

}] (2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 
j?25 (A) LENGTH: 82 base pairs 

j =? lj (B) TYPE: Nucleic Acid 

ill {C} STRANDEDNESS: Single 

Ljl (D) TOPOLOGY: Linear 

^30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:67 : 

M TCGAGAAGGA GTAGCCAGAA GCTGCACAGG ACAAACGGAG TGAGCCCCCT 50 

,-35 GGCTGCACCA GGCCACCGCC AGACTGCACT AG 82 

(2) INFORMATION FOR SEQ ID NO:68: 

(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH : 8120 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

TTCGAGCTCG CCCGACATTG ATTATTGACT AGAGTCGATC GACAGCTGTG 50 

50 GAATGTGTGT CAGTTAGGGT GTGGAAAGTC CCCAGGCTCC CCAGCAGGCA 100 

GAAGTATGCA AAGCATGCAT CTC AATTAGT CAGCAACCAG GTGTGGAAAG 150 

TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 200 

55 

GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG CCCCTAACTC 250 
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CGCCCAGTTC CGCCCATTCT CCGCCCCATG GCTGACTAAT TTTTTTTATT 300 
TATGCAGAGG CCGAGGCCGC CTCGGCCTCT GAGCTATTCC AGAAG TAG TG 350 

5 

AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTA GCTTATCCGG 400 
CCGGGAACGG TGCATTGGAA CGCGGATTCC CCGTGCCAAG AGTGACGTAA 450 
10 GTACCGCCTA TAGAGCGATA AGAGGATTTT ATCCCCGCTG CCATCATGGT 500 
TCGACCATTG AACTGCATCG TCGCCGTGTC CCAAAATATG GGGATTGGCA 550 
AGAACGGAGA CCTACCCTGG CCTCCGCTCA GGAACGAGTT CAAGTACTTC 60Q 

15 

CAAAGAATGA CCACAACCTC TTCAGTGGAA GGTAAACAGA ATCTGGTGAT 650 

TATGGGTAGG AAAACCTGGT TCTCCATTCC TGAGAAGAAT CGACCTTTAA 700 

_ 20 AGGACAGAAT TAATATAGTT CTCAGTAGAG AACTCAAAGA ACCACCACGA 750 

GGAGCTCATT TTCTTGCCAA AAGTTTGGAT GATGCCTTAA GACTTATTGA 800 

iiij ACAACCGGAA TTGGCAAGTA AAGTAGACAT GGTTTGGATA GTCGGAGGCA 850 
'.-■J 25 

% GTTCTGTTTA CCAGGAAGCC ATGAATCAAC CAGGCCACCT TAGACTCTTT 900 

Ui GTGACAAGGA TCATGCAGGA ATTTGAAAGT GACACGTTTT TCCCAGAAAT 950 

„ 30 TGATTTGGGG AAATATAAAC CTCTCCCAGA ATACCCAGGC GTCCTCTCTG 1000 

f~ AGGTCCAGGA GGAAAAAGGC ATCAAGTATA AGTTTGAAGT CT AC G AGAAG 1050 

ill AAAGACTAAC AGGAAGATGC TTTCAAGTTC TCTGCTCCCC TCCTAAAGCT 1100 

J! 35 

□ ATGCATTTTT ATAAGACCAT GGGACTTTTG CTGGCTTTAG ATCCCCTTGG 1150 

CTTCGTTAGA ACGCAGCTAC AATTAATACA TAACCTTATG TATCATACAC 1200 
40 ATACGATTTA GGTGACACTA TAGATAACAT CCACTTTGCC TTTCTCTCCA 1250 
CAGGTGTCCA CTCCCAGGTC CAACTGCACC TCGGTTCTAT CGATTGAATT 1300 
CCACCATGGG ATGGTCATGT ATCATCCTTT TTCTAGTAGC AACTGCAACT 1350 

45 

GGAGTACATT CAGAAGTTCA GCTAGTGCAG TCTGG CGGTG GCCTGGTGCA 1400 
GCCAGGGGGC TCACTCCGTT TGTCCTGTGC AGCTTCTGGC TACTCCTTCT 1450 
50 CGAGTCACTA TATGCACTGG GTCCGTCAGG CCCCGGGTAA GGGCCTGGAA 1500 
TGGGTTGGAT ATATTGATCC TTCCAATGGT GAAACTACGT ATAATCAAAA 1550 
GTTCAAGGGC CGTTTCACTT TATCTCGCGA CAACTCCAAA AACACAGCAT 1600 

55 

ACCTGCAGAT GAACAGCCTG CGTGCTGAGG ACACTGCCGT CTATTACTGT 1650 
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GCAAGAGGGG ATTATCGCTA CAATGGTGAC TGGTTCTTCG ACGTCTGGGG 1700 
TCAAGGAACC CTGGTCACCG TCTCCTCGGC CTCCACCAAG GGCCCATCGG 1750 

5 

TCTTCCCCCT GGCACCCTCC TCCAAGAGCA CCTCTGGGGG CACAGCGGCC 1800 



CTGGGCTGCC TGGTCAAGGA CTACTTCCCC GAACCGGTGA CGGTGTCGTG 1B50 



10 GAACTCAGGC GCCCTGACCA GCGGCGTGCA CACCTTCCCG GCTGTCCTAC 1900 



AGTCCTCAGG ACTCTACTCC CTCAGCAGCG TGGTGACTGT GCCCTCTAGC 1950 



AGCTTGGGCA CCCAGACCTA CATCTGCAAC GTGAATCACA AGCCCAGCAA 2000 

15 

CACCAAGGTG GACAAGAAAG TTGAGCCCAA ATCTTGTGAC AAAACTCACA 2050 



CATGCCCACC GTGCCCAGCA CCTGAACTCC TGGGGGGACC GTCAGTCTTC 2100 



20 CTCTTCCCCC CAAAACCCAA GGACACCCTC ATGATCTCCC GGACCCCTGA 2150 



GGTCACATGC GTGGTGGTGG ACGTGAGCCA CGAAGACCCT GAGGTCAAGT 2200 



TCAACTGGTA CGTGGACGGC GTGGAGGTGC ATAATGCCAA GACAAAGCCG 2250 

25 

CGGGAGGAGC AGTACAACAG CACGTACCGT GTGGTCAGCG TCCTCACCGT 2300 



CCTGCACCAG GACTGGCTGA ATGGCAAGGA GTACAAGTGC AAGGTCTCCA 2350 



30 ACAAAGCCCT CCCAGCCCCC ATCGAGAAAA CCATCTCCAA AGCCAAAGGG 2400 



CAGCCCCGAG AACCACAGGT GTACACCCTG CCCCCATCCC GGGAAGAGAT 2450 



GACQAAGAAC CAGGTCAGCC TGACCTGCCT GGTCAAAGGC TTCTATCCCA 2500 

35 

GCGACATCGC CGTGGAGTGG GAGAGCAATG GGCAGCCGGA GAACAACTAC 2550 



AAGACCACGC CTCCCGTGCT GGACTCCGAC GGCTCCTTCT TCCTCTACAG 2600 



40 CAAGCTCACC GTGGACAAGA GCAGGTGGCA GCAGGGGAAC GTCTTCTCAT 2650 



GCTCCGTGAT GCATGAGGCT CTGCACAACC ACTACACGCA GAAGAGCCTC 2700 



TCCCTGTCTC CGGGTAAATG AGTGCGACGG CCCTAGAGTC GACCTGCAGA 2750 

45 

AGCTTGGCCG CCATGGCCCA ACTTGTTTAT TGCAGCTTAT AATGGTTACA 2800 



AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT TTTTTCACTG 2850 



50 CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG 2900 



GATCGATCGG GAATTAATTC GGCGCAGCAC CATGGCCTGA AATAACCTCT 2950 



GAAAGAGGAA CTTGGTTAGG TACCTTCTGA GGCGG AAAGA ACCATCTGTG 3000 

55 

GAATGTGTGT CAGTTAGGGT GTGGAAAGTC CCGAGGCTCC CCAGCAGGCA 3050 
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GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAG GTGTGGAAAG 3100 



TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 3150 

5 

GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG CCCCTAACTC 3200 



CGCCCAGTTC CGCCCATTCT CCGCCCCATG GCTGACTAAT TTTTTTTATT 3250 



10 TATGCAGAGG CCGAGGCCGC CTCGGCCTCT GAGCTATTCC AGAAGTAGTG 3300 



AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTA GCTTATCCGG 3350 



CCGGG AACGG TGCATTGGAA CGCGGATTCC CCGTGCCAAG AGTCAGGTAA 3400 

15 

GTACCGCCTA TAGAGTCTAT AGGCCCACCC CCTTGGCTTC GTTAGAACGC 3450 



GGCTACAATT AATACATAAC CTTTTGGATC GATCCTACTG ACACTGACAT 3500 



20 CCACTTTTTC TTTTTCTCCA CAGGTGTCCA CTCCCAGGTC CAACTGCACC 3 550 



TCGGTTCGCG AAGCTAGCTT GGGCTGCATC GATTGAATTC CACCATGGGA 3 600 



TGGTCATGTA TCATCCTTTT TCTAGTAGCA ACTGCAACTG GAGTACATTC 3650 

25 

AGATATCCAG ATGACCCAGT CCCCGAGCTC CCTGTCCGCC TCTGTGGGCG 3700 



ATAGGGTCAC CATCACCTGC AGGTCAAGTC AAAGCTTAGT ACATGGTATA 3750 



30 GGTGCTACGT ATTTACACTG GTATCAACAG AAACCAGGAA AAGCTCCGAA 3800 



ACTACTGATT TACAAAGTAT CCAATCGATT CTCTGGAGTC CCTTCTCGCT 3850 



TCTCTGGATC CGGTTCTGGG ACGGATTTCA CTCTGACCAT CAGCAGTCTG 3900 

35 

CAGCCAGAAG ACTTCGCAAC TTATTACTGT TCACAGAGTA CTCATGTCCC 3 9 50 



GCTCACGTTT GGACAGGGTA CCAAGGTGGA GATCAAACGA ACTGTGGCTG 4000 



40 CACCATCTGT CTTCATCTTC CCGCCATCTG ATGAGCAGTT GAAATCTGGA 4050 



ACTGCTTCTG TTGTGTGCCT GCTGAATAAC TTCTATCCCA GAGAGGCCAA 4100 



AGTACAGTGG AAGGTGGATA ACGCCCTCCA ATCGGGTAAC TCCCAGGAGA 4150 

45 

GTGTCACAGA GCAGGACAGC AAGGACAGCA CCTACAGCCT CAGCAGCACC 4200 



CTGACGCTGA GCAAAGCAGA CTACGAGAAA CACAAAGTCT ACGCCTGCGA 4250 



50 AGTCACCCAT CAGGGCCTGA GCTCGCCCGT CACAAAGAGC TTCAACAGGG 4300 



GAGAGTGTTA AGCTTGGCCG CCATGGCCCA ACTTGTTTAT TGCAGCTTAT 4350 



AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT 4400 

55 

TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT 4450 
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ATCATGTCTG GATCGATCGG GAATTAATTC GGCGCAGCAC CATGGCCTGA 4500 
AATAACCTCT GAAAGAGGAA CTTGGTTAGG TACCTTCTGA GGCGGAAAGA 4550 

5 

ACCAGCTGTG GAATGTGTGT CAGTTAGGGT GTGGAAAGTC CCCAGGCTCC 4600 
CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAG 4650 
10 GTGTGGAAAG TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC 4700 
ATCTCAATTA GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG 4750 
CCCCTAACTC CGCCCAGTTC CGCCCATTCT CCGCCCCATG GCTGACTAAT 4 BOO 

15 

TTTTTTTATT TATGCAGAGG CCGAGGCCGC CTCGGCCTCT GAGCTATTCC 4850 
AGAAGTAGTG AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTG 4900 
r 20 TTACCTCGAG CGGCCGCTTA ATTAAGGCGC GCCATTTAAA TCCTGCAGGT 4950 
^jj AACAGCTTGG CACTGGCCGT CGTTTTACAA CGTCGTGACT GGGAAAACCC 5000 

"l* TGGCGTTACC CAACTTAATC GCCTTGCAGC ACATCCCCCC TTCGCCAGCT 5050 

;^25 

::; ;: ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgt 5100 



AGCCTGAATG GCGAATGGCG CCTGATGCGG TATTTTCTCC TTACGCATCT 5150 



;ii 30 GTGCGGTATT TCACACCGCA TACGTCAAAG CAACCATAGT ACGCGCCCXG 5200 

f'7 TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG 52 50 

HI CTACACTTGC CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC 5300 

iJB5 

r;| TTTCTCGCCA CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT 53 50 

CCCTTTAGGG TTCCGATTTA GTGCTTTACG GCACCTCGAC CCCAAAAAAC 5400 
40 TTGATTTGGG TGATGGTTCA CGTAGTGGGC CATCGCCCTG ATAGACGGTT 5450 
TTTCGCCCTT TGACGTTGGA GTCCACGTTC TTTAATAGTG GACTCTTGTT 5500 
CCAAACTGGA ACAACACTCA ACCCTATCTC GGGCTATTCT TTTGATTTAT 5550 

45 

AAGGGATTTT GCCGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA 5600 
CAAAAATTTA ACGCGAATTT TAACAAAATA TTAACGTTTA CAATTTTATG 5650 
50 GTGCACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA AGCCAACTCC 5700 
GCTATCGCTA CGTGACTGGG TCATGGCTGC GCCCCGACAC CCGCCAACAC 57 50 
CCGCTGACGC GCCCTGACGG GCTTGTCTGC TCCCGGCATC CGCTTACAGA 5800 

55 

CAAGCTGTGA CCGTCTCCGG GAGCTGCATG TGTCAGAGGT TTTCACCGTC 5850 
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ATCACCGAAA CGCGCGAGGC AGTATTCTTG AAGACGAAAG GGCCTCGTGA 5900 
TACGCCTATT TTTATAGGTT AATGTCATGA TAATAATGGT TTCTTAGACG 5950 

5 

TCAGGTGGCA CTTTTCGGGG AAATGTGCGC GGAACCCCTA TTTGTTTATT 6000 
TTTCTAAATA CATTCAAATA TGTATCCGCT CATGAGACAA TAACCCTGAT 6050 
10 AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATCAGTAT TCAACATTTC 6100 
CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC 6150 
TCACCCAGAA ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG 6200 

15 

CACGAGTGGG TTACATCGAA CTGGATCTCA ACAGCGGTAA GATCCTTGAG 6250 
AGTTTTCGCC CCGAAGAACG TTTTCCAATG ATGAGCACTT TTAAAGTTCT 6300 
20 GCTATGTGGC GCGGTATTAT CCCGTGATGA CGCCGGGCAA GAGCAACTCG 6350 
GTCGCCGCAT ACACTATTCT CAGAATGACT TGGTTGAGTA CTCACCAGTC 6400 
ACAGAAAAGC ATCTTACGGA TGGCATGACA GTAAGAGAAT TATGCAGTGC 6450 

25 

TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT CTGACAACGA 6500 
TCGGAGGACC GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT 6550 
30 GTAACTCGCC TTGATCGTTG GGAACCGGAG CTGAATGAAG CCATACCAAA 6600 
CGACGAGCGT GACACCACGA TGCCAGCAGC AATGGCAACA ACGTTGCGCA 6650 
AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA ACAATTAATA 6700 

35 

GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT 6750 
TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT 6800 
40 CTCGCGGTAT CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC 6850 
GTAGTTATCT ACACGACGGG GAGTCAGGCA ACTATGGATG AACGAAATAG 6900 
ACAGATCGCT GAGATAGGTG CCTCACTGAT TAAG C ATTGG TAACTGTCAG 6950 

45 

ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT TCATTTTTAA 7000 
TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT 7050 
50 CCCTTAACGT GAGTTTTCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA 7100 
TCAAAGGATC TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG 7150 
CAAACAAAAA AACCACCGCT ACCAGCGGTG GTTTGTTTGC CGGATCAAGA 7200 
GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA GCGCAGATAC 7250 
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CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC 7300 
TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 7350 

5 

TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT 7400 
AGTTACCGGA TAAGGCGCAG CGGTCGGGCT GAACGGGGGG TTCGTGCACA 7450 
10 CAGCCCAGCT TGGAGCGAAC GACCTACACC GAACTGAGAT ACCTACAGCG 7500 
TGAGCATTGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT 7550 
ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA 7600 

15 

GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG 7 650 

ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA 7700 

r 20 AAAACGCCAG CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT 7750 

id I TTTGCTCACA TGTTCTTTCC TGCGTTATCC CCTGATTCTG TGGATAACCG 7 800 

HI TATTACCGCC TTTGAGTGAG CTGATACCGC TCGCCGCAGC CGAACGACCG 78 50 
□ |25 

?\ : AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC AATACGCAAA 7900 

;!f CCGCCTCTCC CCGCGCGTTG GCCGATTCAT TAATCCAGCT GGCACGACAG 7 950 



- 30 GTTTCCCGAC TGGAAAGCGG GCAGTGAGCG CAACGCAATT AATGTGAGTT 8000 

; bs l ACCTCACTCA TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT 8050 

[ l! ATGTTGTGTG GAATTGTGAG CGGATAACAA TTTCACACAG GAAACAGCTA 8100 

^35 

□ TGACCATGAT TACGAATTAA 8120 

(2) INFORMATION FOR SEQ ID NO: 69: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: Nucleic Acid 
<C) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



AAAAGGGTAT CTAGAGGTTG AGGTGATTTT ATGAAAAAGA ATATCGCATT 50 

50 

TCTTCTTGCA TCTATGTTCG TTTTTTCTAT TG C T AC AAA C GCGTACGCTG 100 

AGGTTCAGCT AGTGCAGTCT GGCGGTGGCC TGGTGCAGCC AGGGGGCTCA 150 

55 CTCCGTTTGT CCTGTGCAGC TTCTGGCTAC TCCTTCTCGA GTCACTATAT 200 
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GCACTGGGTC CGTCAGGCCC CGGGTAAGGG CCTGGAATGG GTTGGATATA 250 
TTGATCCTTC CAATGGTGAA ACTACGTATA ATCAAAAGTT CAAGGGCCGT 300 
5 TTCACTTTAT CTCGCGACAA CTCCAAAAAC ACAGCATACC TGCAGATGAA 350 
CAGCCTGCGT GCTGAGGACA CTGCCGTCTA TTACTGTGCA AGAGGGGATT 400 
ATCGCTACAA TGGTGACTGG TTCTTCGACG TCTGGGGTCA AGGAACCCTG 450 

to 

GTCACCGTCT CCTCGGCCTC CACCAAGGGC CCATCGGTCT TCCCCCTGGC 500 
ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG 550 
15 TCAAGGACTA CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC 600 
CTGACCAGCG GCGTGCACAC CTTCCCGGCT GTCCTACAGT CCTCAGGACT 650 
CTACTCCCTC AGCAGCGTGG TGACCGTGCC CTCCAGCAGC TTGGGCACCC 700 

20 

AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC CAAGGTCGAC 750 
AAGAAAGTTG AGCCCAAATC TTGTGACAAA ACTCACACAT GCCCGCCTGA 800 
25 (2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 amino acids 

(B) TYPE: Amino Acid 
30 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Met; Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
35 1 5 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 
20 25 30 

40 Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 

35 40 , 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
50 55 60 

45 

Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr lie Asp 
65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 
50 80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 

95 100 105 

55 Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 

110 115 120 
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Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 

170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 

200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 

Tyr He Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 

245 250 255 

Pro 
256 

(2) INFORMATION FOR SEQ ID NO: 71: 

fi) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 452 amino acids 
(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Glu Val Gin Leu Val Gin Ser Gly Gly Gly Leu Val Gin Pro Gly 

15 10 15 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Tyr Ser Phe Ser 

20 25 30 

Ser His Tyr Met His Trp Val Arg Gin Ala Pro Gly Lvs Gly Leu 

35 40 " 45 

Glu Trp Val Gly Tyr He Asp Pro Ser Asn Gly Glu Thr Thr Tyr 

50 55 60 

Asn Gin Lys Phe Lys Gly Arg Phe Thr Leu Ser Arg Asp Asn Ser 

65 70 75 

Lys Asn Thr Ala Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 
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80 



85 



90 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Thr Ala Val Tyr Tyr Cys Ala Arg Gly Asp Tyr Arg Tyr Asn Gly 

95 100 105 

Asp Trp Phe Phe Asp Val Trp Gly Gin Gly Thr Leu Val Thr Val 

110 115 120 

Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro 

125 130 135 

Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu 

140 145 150 

Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser 

155 160 165 

Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gin 

170 175 180 

Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser 

185 190 195 

Ser Ser Leu Gly Thr Gin Thr Tyr lie Cys Asn Val Asn His Lys 

200 205 210 

Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys 

215 220 225 

Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu 

230 235 240 

Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr 

245 250 255 

Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp 

260 265 270 

Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp 

275 280 285 

Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 

290 295 300 

Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His 

305 310 315 

Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn 

320 325 330 

Lys Ala Leu Pro Ala Pro He Glu Lys Thr He Ser Lys Ala Lvs 

335 340 345 

Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg 

350 355 360 
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Glu Glu Met Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys 
365 370 375 

Gly Phe Tyr Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly 
5 380 385 390 

Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser 
395 400 405 

10 Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser 

410 415 420 



15 



30 



45 



Arg Trp Gin Gin Gly Asn Val Phe Ser Cys Ser Val Met His Glu 
425 430 435 

Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro 
440 445 450 



Gly Lys 
20 452 

(2) INFORMATION FOR SEQ ID NO:72 : 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 219 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
15 10 15 



Gly Asp Arg Val Thr He Thr Cys Arg Ser Ser Gin Ser Leu Val 

35 20 25 30 

His Gly He Gly Ala Thr Tyr Leu His Trp Tyr Gin Gin Lys Pro 

35 40 45 

40 Gly Lys Ala Pro Lys Leu Leu He Tyr Lys Val Ser Asn Arg Phe 

50 55 60 



Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 
65 70 75 

Phe Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr 
80 85 90 



Tyr Tyr Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly Gin 

50 95 100 105 

Gly Thr Lys Val Glu He Lys Arg Thr Val Ala Ala Pro Ser Val 

110 115 120 

55 Phe He Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser Gly Thr Ala 

125 130 135 
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10 



Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala Lys 

140 145 150 

Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly Asn Ser Gin 

155 160 165 

Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr Tyr Ser Leu 

170 175 ISO 

Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys 

185 190 195 



15 



Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser Pro Val 
200 205 210 



Thr Lys Ser Phe Asn Arg Gly Glu Cys 
215 219 



293 



